Add CMakePresets for target micro arch by AntoinePrv · Pull Request #1348 · xtensor-stack/xsimd

AntoinePrv · 2026-05-13T16:14:09Z

I've taken the direction of explicit flags such as -mavx -mno-avx2.
This is IMHO less error prone and more accurate that using architecture name such as haswell.
The main difference is that this does not add other feature flags or change the -mtune model.
For a test setting accuracy is more important IMHO.

serge-sans-paille · 2026-05-14T07:27:52Z

I really like your approach and will eagerly merge it once it validates \o/

AntoinePrv · 2026-05-14T08:07:25Z

I've only kept the micro architecture target in CMakePresets.txt because combining with (debug/release) / (xtl on/off)... results in a combinatorial explosion of presets for which there is currently no support.
Another shortcoming is that we cannot dispatch here based on compiler for MSVC flags. We can do it based on OS but it is not quite the same.

I have ongoing work to actually do the same as these presets at the CMake level, with a function that can be made available to users to help in the tooling for dynamic dispatch (our current solution in Arrow is very verbose).
In this case, we'd need to also define a safe -march baseline. The reason is the code in these translation units might also include non SIMD code (this is sometimes the case in Arrow). In this case, with very advanced instruction sets, we're leaving perf on the table by having a x86-64 baseline. But what should be a reasonable baseline for dynamic dispatching to for example avx2?

haswell (first avx2) also has fma3 and bmi2
-march=haswell -mno-fma3 -mno-bmi2 if that is a thing?
Or go further back? sandybridge (first avx)? nehalem (first sse4.2)

AntoinePrv · 2026-05-20T12:09:31Z

@serge-sans-paille this is in a ready state, but I am not fully happy with it.

Getting into AVX512, and AVX512-256, the combinatorial explosion of possibilities start to show again.
Inheritance of flags from other settings is also not possible.

This reinforce my belief that I should keep on with the work to do it in CMake (that could also be installed for our users to improve our dynamic dispatch tooling), and also homogenized with the test TARGET_ARCH var.

This PR is not completely worthless though. For example we now have the possibility to really test with avx512f, which was not the case before because no Intel arch is limited to the f feature only.

What do you think? Should we give this some mileage before I get the time to work on a CMake solution?

serge-sans-paille · 2026-05-23T06:51:01Z

-        fi
        if [[ '${{ matrix.sys.flags }}' == 'i386' ]]; then
-          CXX_FLAGS="$CXX_FLAGS -m32"
+          export CXXFLAGS="$CXXFLAGS -m32"


Yes, there is a weird mismatch in master. Both CXX_FLAGS and CXXFLAGS where set but only CXX_FLAGS was explicitly passed to CMake. CXXFLAGS is picked up automatically but it was not exported.

serge-sans-paille · 2026-05-23T06:54:47Z

+        {
+            "name": "avx2",
+            "cacheVariables": {
+                "CMAKE_CXX_FLAGS": "$env{CXXFLAGS} -march=x86-64-v2 -mno-sse4a -mavx2 -mno-avx512f"


we sometime have fallback from avx2 instructions to sse instructions. How can this work??

I do understand the need to prune higher instruction sets, but not the need to prune lower ones, please explain.

Do you mean -mno-sse4a ? This can be removed, I added when trying to debug some -march=native that was added by the absence of TARGET_ARCH.

Though it is not a problem here: sse4a is an AMD extension that was never implemented on Intel (and that is why it was failing in SDE).

serge-sans-paille

All looks good, except the question on pruning lower architectures which raises a big unknown to me.

serge-sans-paille reviewed May 14, 2026

View reviewed changes

Comment thread .github/workflows/linux.yml Outdated

AntoinePrv force-pushed the cmake-presets branch from e3bc5ad to cc1883e Compare May 14, 2026 07:52

AntoinePrv force-pushed the cmake-presets branch 4 times, most recently from 0d3d6ea to c573d9d Compare May 20, 2026 07:17

serge-sans-paille reviewed May 23, 2026

View reviewed changes

Comment thread .github/workflows/linux.yml

serge-sans-paille reviewed May 23, 2026

View reviewed changes

Comment thread .github/workflows/linux.yml

serge-sans-paille reviewed May 23, 2026

View reviewed changes

Comment thread CMakePresets.json

serge-sans-paille reviewed May 23, 2026

View reviewed changes

serge-sans-paille requested changes May 23, 2026

View reviewed changes

Add CMakePresets.txt

65ab60b

AntoinePrv force-pushed the cmake-presets branch from 7ad91b2 to 65ab60b Compare May 25, 2026 08:02

AntoinePrv added 4 commits May 25, 2026 10:53

Safer trilival-auto-var-init flag

ef1f178

Fix ambiguity

4fcb0fd

Bump SDE target

c4c12c5

Simplify common_memory load_masked

7a256e5

AntoinePrv force-pushed the cmake-presets branch from 2969f40 to 7a256e5 Compare May 25, 2026 11:13

AntoinePrv added 4 commits May 25, 2026 13:24

Fix avx2

ca1f529

Fix avx2

1605e31

fix sse call

f88e98a

Fix reinterpre cast

ec071a8

AntoinePrv force-pushed the cmake-presets branch from f85bb44 to ec071a8 Compare May 25, 2026 11:54

Fix more

81879b7

AntoinePrv force-pushed the cmake-presets branch from 5a393b4 to a63be74 Compare May 25, 2026 13:18

Fix batch_bool

e91290d

AntoinePrv force-pushed the cmake-presets branch from a63be74 to e91290d Compare May 25, 2026 14:00

Fix batch_bool

e368737

AntoinePrv mentioned this pull request May 25, 2026

fix: avx512vl masked load/store #1353

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add CMakePresets for target micro arch#1348

Add CMakePresets for target micro arch#1348
AntoinePrv wants to merge 12 commits into
xtensor-stack:masterfrom
AntoinePrv:cmake-presets

AntoinePrv commented May 13, 2026 •

edited

Loading

Uh oh!

Uh oh!

serge-sans-paille commented May 14, 2026

Uh oh!

AntoinePrv commented May 14, 2026

Uh oh!

AntoinePrv commented May 20, 2026

Uh oh!

Uh oh!

serge-sans-paille May 23, 2026

Uh oh!

AntoinePrv May 25, 2026

Uh oh!

Uh oh!

Uh oh!

serge-sans-paille May 23, 2026

Uh oh!

serge-sans-paille May 23, 2026

Uh oh!

AntoinePrv May 25, 2026

Uh oh!

serge-sans-paille left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

AntoinePrv commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

serge-sans-paille commented May 14, 2026

Uh oh!

AntoinePrv commented May 14, 2026

Uh oh!

AntoinePrv commented May 20, 2026

Uh oh!

Uh oh!

serge-sans-paille May 23, 2026

Choose a reason for hiding this comment

Uh oh!

AntoinePrv May 25, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

serge-sans-paille May 23, 2026

Choose a reason for hiding this comment

Uh oh!

serge-sans-paille May 23, 2026

Choose a reason for hiding this comment

Uh oh!

AntoinePrv May 25, 2026

Choose a reason for hiding this comment

Uh oh!

serge-sans-paille left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

AntoinePrv commented May 13, 2026 •

edited

Loading